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Abstract. The best bound known on 2-locally distillable entanglement is that of 
Vedral and Plenio, involving a certain measure of entanglement based on relative 
entropy. It turns out that a related argument can be used to give an even stronger 
bound; we give this bound, and examine some of its properties. In particular, and 
in contrast to the earlier bounds, the new bound is not additive in general. We give 
an example of a state for which the bound fails to be additive, as well as a number 
of states for which the bound is additive. 



One of the central problems in quantum information theory is that of entangle- 
ment distillation [1] , the production of maximally entangled states from partially 
entangled states. The purpose of the present note is to give a new upper bound on 
the rate at which entanglement can be distilled. 

In general, if C is a given class of physical operations, we define C-distillable 
entanglement as follows: 

Definition. The C-distillable entanglement of a state p on a state space Va ® Vb 
is the maximum number Dc(p) such that there exists a sequence 

% ■ {V A ® Vs)® ni -> Vi ® V t 
of operations from C , with ni — » oo, 

— log 2 dimK: -> D c (p), 

m 

and 

F{%{p® n )) -> 1. 

See [2] for a discussion of why this is a valid definition of distillable entanglement. 
Here F(a) is the fidelity of the state <r, defined by 

F(a) = <f> + (V) f o-<I> + (V), 

* +( y)=-n=? E I**)- 

Vdim V ,^r-rT. 

Here § + (V) is a chosen maximally entangled state. 

The history of bounds on 2-locally distillable entanglement (for which C is gen- 
erated by local operations and two-way classical communication) involves a rather 
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curious phenomenon. The first bound, entanglement of formation, is most easily 
proved for 2-local operations. However, the second, stronger, bound of Vedral and 
Plenio ([3], see also [4]) applies to a larger class of operations, namely that of sep- 
arable operations. The bound of the present note carries this even further, both 
strengthening the bound and enlarging the set of allowed operations. This phe- 
nomenon is rather counter-intuitive, since enlarging the class of allowed operations 
would be expected to increase the distillablc entanglement. 

The new bound 

As in [4], the key to the new bound is the observation that if we are given a 
process that distills entanglement from p at a given rate and high fidelity, and 
apply the process to a different state a, then there is a limit to how much the 
fidelity can be reduced by doing so. Given an upper bound on the fidelity any 
process of that rate can obtain from a, we may be able to deduce that no process 
can obtain high fidelity from p. 

We will state this in some generality, to support possible future applications. 

Theorem 1. Let C be some class of operations. Suppose we are given a state a 
on Va <8> Vb and an increasing, left continuous, function a : M+ — > M+ such that 
any operation 

T : (Va® Vb)®" -» V <g> V 

from C satisfies 

i log 2 F T {o-® n ) < log 2 dim V). 

Then for any other state p, 

a(D c (p)) < S(p\\a), 

where 

S(p||a)=Tr(p(log 2 (p)-log 2 (a))) 
is the relative entropy of p and a. 

Proof. The crucial observation is that for any operator T the function Fq- is linear 
on density operators, and is bounded between and 1 . It follows that we can write 
it in the form 

F T (w) = Tr{F T u>) 

for some (uniquely determined) operator Ft such that Ft and 1 — Ft arc both 
positive. 

For e > 0, let n, V, T give a process from C with 

-log 2 dimV> D c (p)(l-e), 
n 

and 

Tr(F T p® n ) > 1 - e. 
On the other hand, by assumption, 

Tr(Frcr®") < 2~ nal - ™ log2 dim v \ 
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Lemma 1 below then tells us that 



>S(/ C, || cr ) > lirnsupa(— log 2 dimF) 

e^o n 

> limsupa((l — e)Dc(p)) 

e— 

= a(D c (p)). 

□ 

Remark. The above argument is a hybrid of the arguments of [3] and [4] . In par- 
ticular, it should be noted that the above argument does not require any assumption 
that the resulting bound be additive. 

Lemma 1. Let p and a be states on a common Hilbert space V, such that S(p\\a) 
is finite. For n € Z + and < e < 1, define 

i?(n,e)=inf{log 2 Tr(a®"7r)}, 

7T 

where tt ranges over positive operators on V® n with both tt and 1 — it positive, and 
such that 

Tr(p®"7r) > 1 - e. 

Then 

liminf lim — R(n,e) > — S(p\\a). 

Proof. From the condition on n, it follows that {tt, 1 — tt} is a POVM. Consequently, 
Uhlmann's monotonicity theorem ([5], theorem 1.5) tells us that, writing 

p p = Tr(p®"7r), 
p a =Tr( ( r®" 7 r), 

we have 

nS(p\\a) > p p (\og 2 (p p ) - log 2 ( P(T )) + (1 -p p )(log 2 (l - p p ) - log 2 (l - p a )). 
Now, this in turn is bounded below by 

-1 -pp\og 2 ( P<J ) - (1 -p <9 )log 2 (l -Pa). 
If we divide by n, the first and third terms will be negligible unless 

p a = Tr(<7®"7T) > i 

say. But such a tt could not possibly provide a counterexample to the lemma. It 
remains to consider the second term. But 

--Pplog 2 (p<r) > -(1 -e)-log 2 (2v). 
n n 



4 



E. M. RAINS 



It follows that 



-(1-e) lim -logM <S(p\\a). 

The lemma follows by taking the limit as e — > 0. □ 
Remark. Indeed, it is the case that 



lim lim —R(n,e) 

e— *0 n^oo n 



S(p\\a) 



([5], equation 1.31), in which the operator n may be assumed to be a projection. 
In particular the conclusion of theorem 1 is the strongest that can be made from 
its hypotheses. 

To obtain a bound, then, we need to find states a for which we can bound the 
fidelity. Let T denote the partial transpose operator of [6]. Let Cr be the set 
of positive-partial-transpose (p.p.t.) superoperators, that is completely positive, 
trace-preserving superoperators S such that the superoperator 



is also completely positive. It is not too difficult to see that any separable super- 
operator is also p.p.t., and thus any 2-local superoperator is p.p.t. Similarly, we 
will say a state p is p.p.t. if p r is positive semi-definite. Note that the creation of 
a p.p.t. state is a p.p.t. operation. We will write 'T-distillable entanglement" for 
"Cr-distillablc entanglement" . 

The key observation [7] is that p.p.t. states must have a L-distillable entan- 
glement of 0, because the fidelity of a p.p.t. state can be bounded away from 1. 
Indeed: 

Lemma 2. Let a be a p.p.t. state on a space V <g> V. Then 



S r :p^(S(p r )f 



F(a)< 



1 



dimV 



Proof. We have 



F(a) = Tr($ + (y)$+(F)V) 

= Tr(($+(y)$+(y) t ) r ( r r ). 



But 



(* + (V)*+(V)i) r = -^'£\ij)<ji\, 



and thus has eigenvalues of absolute value at most 1/ dimF. So 



Tr(($ + (^)$+(T/) t ) r ( j r ) < 



1 




1 



dimy 



dimV 



□ 



Remark. The new bound is essentially an update of the Vedral-Plenio bound to 
take this observation into account. 
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Theorem 2. For any state p and any p.p.t. state a on the same bipartite Hilbert 
space, 

Dr(fi) < S(p\\a). 

Proof. It suffices to show that the function a of theorem 1 can be taken to be 1. 
In other words, we must show that for any p.p.t. superoperator 

T : {V A ®V B )® n ^V®V, 

we have 

F ^ * dinV 

But the image of a p.p.t. operator under a p.p.t. superoperator is p.p.t., since 

TH r ^ r (^ r ), 



so the lemma applies. □ 
The statement that 



F T {<r® n ) < 



dimV 

for a p.p.t. superoperator T and a p.p.t. state a is a special case of the following: 

Theorem 3. Let T be a p.p.t. superoperator with output dimension K and asso- 
ciated fidelity operator Fq- . Then 

1 r 1 

< F$ < — , 

K ~ T ~ K 1 

where for Hermitian operators A and B, A < B means that B — A is positive 
semi- definite. 

Proof. It suffices to show that for any density operator p, 

^<Tr(^)<l. 

But 

Ty(F^p) = Tr(F TP r ) 

= Tr(<f>+(K)<f> + (K)tT(p r )) 
= Tr(<$> + (K)<$> + (K)\T r (p)f) 
= Ti((^+(K)^+(K^) r T r (p)). 

Since ~{l/K) < ($+(if)$+(iOt) r < (1/K), and T r (p) is a density operator, the 
result follows. □ 

Remarks. (1) It is an open question whether this inequality, together with the 
inequality 

< F T < 1 

which holds for all superoperators, can be used to give a stronger bound than that 
of Theorem 2, which essentially only uses the inequalities one at a time. (2) This 
statement is essentially a generalization of equation (16) of [4]. 
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Optimizing a 

To obtain the full strength of the bound of theorem 2, it is necessary to optimize 
the choice of a. In the sequel, we will say that a p.p.t. a is optimal for p if 

S(p\\a)=imnS(p\\<j') d = f B r (p) 

a 

where a' ranges over all p.p.t. states. 

Theorem 4. Suppose p is a positive definite state. Then the p.p.t. state a is 
optimal for p if and only if, setting 

K=l-D a Tr(plog(a)), 

where D a is the matrix derivative, we have 

<t t K t = and K T > 0. 

Proof. We note first that a must also be positive definite. Otherwise, 5(p||cr) would 
be infinite, but this is impossible, since S{p\\ d ^ ap ) < oo. Thus we must solve the 
optimization problem: 

Minimize S(p\\cr) subject to the constraints Trcr = 1, a > and a r > 0. 

Now, S(p\\a) is a convex function ([5], theorem 1.4), that is: 

S(p||a<7i + (1 - a)a 2 ) < aS(p||ai) + (1 - a)S(p\\a 2 ). 

Moreover, the set of p.p.t. density operators is convex. Thus a is optimal for p 
if and only if it satisfies the Karush-Kuhn- Tucker conditions ([8], Theorem 2.1.4). 
For the set of p.p.t. density operators, this becomes: 

J D CT Tr( (O log( ( r)) + A + X = 0, 

for some number A and Hermitian matrix K, where K r is positive and supported 
on the kernel of a r . Multiplying on the left by a and taking a trace, we find 

-A = Tr(aD CT Tr(plog(a))) 

= j t TT(p\og(a + ta)) 

= ^Tr(plog(l+t)) 
= 1. 

□ 

If p is only semi-definite, then a can be semi-definite, and the condition is some- 
what more complicated: 

l-D a Tr(ploga)=K + L, 

where oh = 0, a 1 ' K T = 0, and both L and K T are positive semi-definite. For 
simplicity, we will assume that p is definite in the proofs below, but in each case, 
the proof can be adapted to this more complicated case. 
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Remark. In the published version of this paper, the positivity condition on K 
was overlooked. The resulting condition is still valid (for p definite and not p.p.t.) 
when a is a smooth point on the boundary of the set of p.p.t. operators; the fact 
that the optimal a must be on the boundary forces K to have at least one positive 
eigenvalue. Unfortunately, a tensor product of two boundary points is never a 
smooth point on the larger cone, and thus most of the additivity results of the 
published version are invalid. 

One consequence of theorem 4 is that for a specific a, it is reasonably straightfor- 
ward to determine the set of p for which it is optimal. For instance, if we consider 
the p.p.t. state 
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we find that a is optimal for (among others) the state 
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where x = l/ln(73/23). However, a <B> a is not optimal for p <g> p. It follows 
immediately that 

Br{p®p) < 2B T (p). 

Thus the new bound is not additive. 

Since Br is certainly subadditive, we can regularize to a stronger bound: 

B r (p) d ^ lim -B r (p® n ). 

n— >oo Jl 

It is not clear how to compute this bound, however. It turns out, however, that 
there are a number of states for which the bound is additive, and in particular 
Br = Br- The simplest sufficient condition seems to be 

Theorem 5. Let p be a definite state such that there exists a optimal for p that 
commutes with p. Suppose further that 

D a (Tr(plog(a))f > 0. 

Then for any other state p' , 

Br{ P ®p') = Br{p)+Br{p l ). 



Similarly, if 
then 



D (T (Tr(plog(a))) r > -f, 



Br(p® n )=nB T (p) 



8 



E. M. RAINS 



for all n > 1 . 

Proof. Let cr' be a state optimal for p'; assume p' is definite (otherwise write it as 
a limit of definite states). We need only show that a® a 1 is optimal for p® p 1 . We 
can thus apply theorem 4. The crucial observation is that 

D a9a , Tr(p ® p' log(<7 ® a')) - £ CT (Tr(p) log(a)) ® D a ,(Tr(p') log(a')) 

whenever [p, cr] = 0. We may simultaneously diagonalize p and a. Then by basic 
properties of the matrix derivative, we have 

£W' Tr(p <g> p' log(cr O cr')) = ^ !ogO ® cr' + ip ® p') 

But the argument to the logarithm is block-diagonal; we can therefore consider 
each block independently. In other words, it suffices to consider the case in which 
a and p are scalars. But then 

j t logoff' + tpp') = p/<Jj t log(cr' + t P '), 

as desired. 

Then the first condition of theorem 4 requires 

(a ® a') T {D a (Tr(p) log(a)) ® £> CT ,(Tr(p') log(a')) - l) r = 0, 
which is straightforward to verify. The other condition is that 

( J D ff (Tr(p)log( C r))®^(Tr(p')log( ( T')) r < 1. 
But this follows since 

# CT (Tr(p)log(cr)) r >0 
D^(Tr(p')log(cr')) r <l. 

The self-additivity claim follows similarly. □ 

Remark. Note that when [p, a) = 0, Do- (Tr (p log cr)) = per -1 . 
Remark. This, again, is weaker than the published version; we thank R. F. 
Werner and K. G. H. Vollbrccht for telling us of the counterexample they found [9]. 
In particular, this applies when p = cr, so 

Corollary 1. If p is p.p.t., then for any other state p' , 

B T {p®p')=B T {p'). 

Exploiting symmetries 

The most powerful tool for computing an optimal cr seems to be the following 
result: 
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Theorem 6. Let p be an arbitrary state on Va®Vb- Let G be a subgroup of 
U(Va) <8> U(Vb) consisting of matrices Ua <S> Ub such that 

(U A ® U B )p(U A ® U B ) ] = P- 
Then there exists some a optimal for p with 

(U A ® UbHUa ® E^)* = a 

for all U A ®U B e G. 

Proof. There certainly exists some a optimal for p. The point, then, is that for any 

U A , U b , 

(U a ® U B )a(U A ® £7 B )t 

is p.p.t., and optimal for 

(Ua®Ub)p(Ua®U b ) ] . 

In particular, this is true for Ua®Ub € G. But G is a closed subgroup of a compact 
Lie group, so there is a unique invariant probability measure on G. Define 

a' = E UA ® UBeG (U A ® U b )o-(Ua ® Ub) 1 ■ 
By convexity, a' is p.p.t., and 

S(p\\a') < S(p\\a). 

Since cr is optimal for p, so is a'. Since er' is clearly preserved by G, we are done. □ 
Example 1 . Let p be a state of the form 

a$ + (y)$ + (y)t + b, 

with p r > (an isotropic state). Then we may take G to be the group 

{U® U : Ue U(V)}. 

But, in fact, G forces p, and thus a to have that form. So there are numbers c and 
d with 

( j = c$ + (y)$+(T/) t + rf. 

Then [p, cr] = 0, and it is straightforward to solve for c and d: 

a = $+(V)$ + (V) t + . 

dimF + 1 V ' K ' dimy(dimV + 1) 

In other words, a is the isotropic state of fidelity 1/ dimF. Computing S(p\\a), we 
find: 
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Theorem 7. Let p be the isotropic state of fidelity F > and dimension K. Then 

B r (p) = log 2 K + Flog 2 F + (1 - F) log 2 ((l - F)/{K - 1)). 
Moreover, for all n > 1, 

B r (p® n ) =nB r (p) 
The second statement follows from theorem 5; we readily verify 

(D a Tr(ploga)) r > -1. 

Remark. (1) If p is maximally entangled, then, while there exists an optimal a 
with full symmetry, there also exist optimal a with much smaller symmetry groups. 
(2) Since Br is a lower bound on entanglement of formation (by the same proof used 
in [3] to show that the Vedral-Plcnio bound is less than entanglement of formation) , 
this theorem gives a lower bound on the entanglement of formation of an isotropic 
state. 

Example 2. In Example 1, the symmetries of p were enough to force [p, a] = 0, 
and thus to force the bound to be additive with respect to p (modulo the derivative 
bound condition). Here we use that idea to give a large family of states for which 
the bound is additive. 

Let G be a finite abelian group. For an clement g e G and a character \ ■ G —* C, 
define the "generalized Bell state" associated to g and x to be 

v g,x = -/= Yl xWlfc. h ~9)- 
V M hec 

If G = Z 2 , we recover the usual Bell states. In general, the generalized Bell states 
for a given group G form an orthonormal basis of maximally entangled states. 

Theorem 8. Suppose p is a mixture of generalized Bell states for the group G. 
Then there exists a optimal for p which is also a mixture of generalized Bell states 
for G, and thus [p, a] = 0. 

Proof. Let Vq be the Hilbert space with basis \h) for /ieG. Define operators X(g) 
and Z(x) by 

X(g)\h) = \h + g), 
Z(x)\h)= X (h)\h). 

Let G be the group generated by X(g) ® X(g) for all g and Z(x) <8> Z(x) for all x- 
Then each generalized Bell state is a common eigenvector for G, since we compute 

x{g') ® x{g>g,x = x(g')v g , x 

Z(x')®Z( x 7 )v g , x =x'(9)v g , x . 

Moreover, for any two generalized Bell states, there exists an element of G that 
distinguishes them. From this, it follows that a state is invariant under G if and 
only if it is a mixture of generalized Bell states. Since any two such mixtures 
commute, it follows that [p, a] = for some a optimal for p. □ 
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For G = Z 2 , we can explicitly solve for <r; if 

p = avoovlo + bv 01 v f 01 + cv 10 v\ + dvuv^, 
with a>^>b>c>d, then 

a = l VonV ™ + w^a) V0lV ^ + 2(T^) Vlo4 ° + whrf 11 ^' 

and 

Bv(fi) = 1 + alog 2 a + (1 - a) log 2 (l - a), 

agreeing with the bound of [4] . In general, it appears to be rather more difficult to 
compute the optimal a analytically. Of particular interest would be the case Z 2 , 
corresponding to mixtures of tensor products of the usual Bell states. 



Maximally correlated states 

We close with consideration of a class of states for which it is a reasonable 
conjecture that the p.p.t. bound is not only tight, but is in fact equal to the 1- 
locally distillable entanglement. This also gives examples of states for which the 
bound is additive, even though theorem 5 does not apply. 

Say that a state p on V <£> V is maximally correlated if for any classical measure- 
ment on V, Alice and Bob will always obtain the same result. In other words, p is 
of the form 

P = ^L a ij\ ii )(jj\ 

i,3 

for some Hermitian, trace 1 operator a on V. 
Theorem 9. If p is maximally correlated, then 

B r (p) = S(Tr A (p))-S(p). 

For any other maximally correlated state p' , 

Bt{p®p) = Bt{p)+B v {p). 



Proof. We first show that the state 

i 



is optimal for p. Certainly, a is p.p.t. (indeed, it is manifestly separable), so it 
remains to apply the Karush-Kuhn- Tucker condition. Of course, a is only semi- 
definite, so we must use the more complicated condition following Theorem 4. 
We have 

AxTr(plogO)) = 'Y^OLi j f{a u ,OLjj)\ii)(ji\, 

i,j 

where 

logq-log/3 

f{a,P) = o ' 

a — p 
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except that 



We thus choose 



f(a,a) = -. 

a 



K = -^2 a vf (a« , otjj ) | *»> (i j I 

+ XJ( 1 - A «)ly><y'l. 

for suitable numbers Ay = A 3 j. We observe that 

cri = a T K T = 0, 

so it remains to verify that L,K r > 0. Clearly L > if and only if each Ay > 0, 
while i-C r is essentially a block matrix with 2x2 blocks 

1 - Ay -aijf(aii,ajj) 
-ajif(au,ajj) 1-Aij 

We thus obtain the conditions 

< Ay < 1 

1 - Ay > | ay 1/(0^,0^). 
We can choose Ay satisfying these conditions if and only if 

\a ij \f{au,a jj ) < 1. 

But, since a is positive, 



\ff \\ 2 <r ft ^ _ /31og(/3) 2 

^tjlJ \ a tn a jj ) ) a ti a jj JKy-ui a jj) ~ ^ _ ^2 



where /3 = ay / . 

Additivity follows from the fact that the tensor product of maximally correlated 
states is maximally correlated. □ 

In particular, 

Corollary 2. If p is a pure state, then 

B r (p)=E f (p). 



Proof. Increasing the dimension of Va or Vb as necessary to make the dimensions 
equal, we find that p is locally equivalent to a maximally correlated state. The 
result then follows immediately from theorem 9. □ 

Here, in fact, D\{p) = Br(p), by the fact [10] that pure states can be distilled 
at a rate equal to their entanglement of formation. We also have 
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Corollary 3. Let p be a maximally correlated state on a 2 x 2 dimensional Hilbert 
space. Suppose Tta(p) = 1/2- Then 

D 1 {p)=B T (p) = l-S{p). 

Proof. This is equivalent to a mixture of two Bell states. As remarked in [4], we 
can distill this using classical error correcting codes, and find 

D 1 {p) = l-S{ P ). 

□ 

Since these two examples are at opposite extremes in a certain sense, the follow- 
ing conjecture seems reasonable: 

Conjecture. For any maximally correlated state p, 

D 1 {p)=B v {p). 
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